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Abstract. We consider a white noise perturbation of dynamics in the 
neighborhood of a heteroclinic network. We show that under the loga- 
rithmic time rescaling the diffusion converges in distribution in a special 
topology to a piecewise constant process that jumps between saddle 
points along the heteroclinic orbits of the network. We also obtain 
precise asymptotics for the exit measure for a domain containing the 
starting point of the diffusion. 



1. Introduction 

In this note, we study small noise perturbations of a smooth continuous 
time dynamical system in the neighborhood of its heteroclinic network. 

The deterministic dynamics is defined on M'^ as the flow (S'*)tg]g generated 
by a smooth vector field b : M"^ — )• M*^, i.e. S^xq is the solution of the initial- 
value problem 

(1.1) x{t) = b{xit)), 

x{0) = Xq. 

We assume that the vector field b generates a heteroclinic network, that is a 
set of isolated critical points connected by heteroclinic orbits of the flow S. 
Heteroclinic orbits arise naturally in systems with symmetries. Moreover, 
they are often robust under perturbations of the system preserving the sym- 
metries, see the survey |9j and references therein, for numerous examples 
and a discussion of mechanisms of robustness. 

We consider the system (jl.ip perturbed by uniformly elliptic noise: 

(1.2) dXeit) = b{XS))dt + ea{XS))dW{t), 

^e(O) =xo, 

where is a standard d-dimensional Wiener process, cf{x) is a nondegener- 
ate matrix of diffusion coefficients for every x, and e > is a small number. 
The initial point xq is assumed to belong to one of the heteroclinic orbits. 

Our principal result on the vanishing noise intensity asymptotics can be 
informally stated as follows. 

Theorem. Under some technical nondegeneracy assumptions, as e — )• 0, 
the process {X^{t\n{e~^)))t>Q converges in distribution in an appropriate 
topology to a process that spends all the time on the set of critical points and 
jumps instantaneously between them along the heteroclinic trajectories. 
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In fact, we shall provide much more detailed information on the limit- 
ing process and describe its distribution precisely. Thus, our result pro- 
vides a unified and mathematically rigorous background for the existing 
phenomenological studies, see e.g. [1]. In particular, we shall see that in 
many cases the limiting process is not Markov. The precise description of 
the limiting process allows to obtain asymptotics of the exit distribution for 
domains containing the starting point. This asymptotic result is of a dif- 
ferent kind than the one provided by the classical Freidlin-Wentzell (FW) 
theory, see [5]. In fact, it allows to compute precisely the limiting probabil- 
ities of specific exit points that are indistinguishable from the point of view 
of the FW quasi-potential. 

To prove the main result, we have to trace the evolution of the process 
along the heteroclinic orbits and in the neighborhood of hyperbolic critical 
points. The latter was studied in [8] and [2], where it was demonstrated 
that in the vanishing diffusion limit, of all possible directions in the unstable 
manifold the system chooses to evolve along the invariant curve associated 
to the highest eigenvalue of the linearization of the system at the critical 
point, and the asymptotics for the exit time was obtained. However, these 
results are not sufficient for the derivation of our main theorem, and more 
detailed analysis is required. 

The paper is organized as follows. In Section [21 we study a simple exam- 
ple of a heteroclinic network. In Section [3l we give non-rigorous analysis of 
the behavior of the diffusion in the neighborhood of one saddle point. The 
general setting and the main result on weak convergence are given in Sec- 
tion [3 To state the result we need to define in what sense the convergence 
is understood. Therefore, we begin our exposition in Section H] with a brief 
description of the relevant metric space, postponing the proofs of all tech- 
nical statements concerning the metric space till Section [T2l In Section [6l 
we state a result on the exit asymptotics and derive it from the main result 
of Section [5l In Section [71 we give the statement of the central technical 
lemma and use it to prove the main result. The proof of the central lemma 
is split into two parts. They are given in Section [8] and Section [9l respec- 
tively. Proofs of some auxiliary statements used in Section [HI are given in 
Section [TTl Section [10] is devoted to an informal discussion of implications 
of our results and their extensions. 
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dynamics by Valentin Afraimovich (see his paper [lOJ). I am grateful to him 
for several stimulating discussions this work began with, as well as for his 
words of encouragement without which it could not have been finished. I 
am also grateful to NSF for their support via a CAREER grant. 
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2. A SIMPLE EXAMPLE 

Here we recall a simple example of a heteroclinic network described in [9j. 
If oi < 0, the deterministic cubic system defined by the drift of the following 
stochastic system 

dXs,i = + aiXl^ + 02X^2 + a^Xl^)dt + edWi, 

(2.1) dX,^2 = ^2,^(1 + aiXl^ + 02X^3 + a^Xl^)dt + edW2, 
dX,,3 = X3,e(l + aiX^g + 02^2 1 + a3Xl^)dt + edW^ 

has 6 critical points 

z± = (±V-l/ai,0,0), 
4 = (0,±v'-l/ai,0), 
Z3± = (0,0,±7-l/ai). 
The matrix of the linearization of the system at z'^ is given by 

diag -2, , , 

and the linearizations at all other critical points are can be obtained from 
it using the symmetries of the system. We see that if 

(2.2) as > oi and 02 < ai, 

then all the critical points are saddles with one unstable direction corre- 
sponding to the eigenvalue (ai — az)/ai. It is shown in [9] that system ()2.ip 
admits 12 orbits connecting zf' to z^, to z^ , and to z^ . Each of 
these orbits lies entirely in one of the coordinate planes. 

Let us equip (|2.ip with an initial condition xq on one of the 12 heteroclinic 
connections, say, on the one connecting z^ to denoted by z^ — > z^. 

The theory developed in this paper allows to describe precisely the limit- 
ing behavior of the process as e — > 0. Namely, the process X^ will stay close 
to the heteroclinic network, moving mostly along the heteroclinic connec- 
tions between the saddle points. At each saddle point it spends a time of or- 
der ln(e~-'^). More precisely, the process defined by Zs{t) = X^{t\n{e~'^)) 
converges to a process that jumps from xq to z^ instantaneously along the 
heteroclinic connection zj*" — > z^, sits at z^ for some time, then chooses one 
of the orbits z^ — > z^ and z^ — > z^, and jumps along that orbit instanta- 
neously, spends some time in the endpoint of that orbit and then chooses a 
new outgoing orbit to follow, etc. However, the details of the process depend 
crucially on the eigenvalues of the linearization at the critical point. 

If at each saddle point the contraction is stronger than expansion, i.e., if 

ai — 02 ^ ai — 03 
ai ai ' 

which is equivalent to 02 + ^3 < 2ai , then the system exhibits loss of memory 
and the sequence of saddle points visited by the limiting process is a standard 
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random walk on the directed graph formed by the network, i.e. a Markov 
chain on saddle points that at each point chooses to jump along one of the 
two possible outgoing connections with probability 1/2. 

If the expansion is stronger than the contraction, i.e., 02 + 03 > 2ai, then 
at Z2, the first saddle point visited, the process still chooses each of the 
two possible next hctcroclinic connections with probability 1/2. However, 
if it chooses — > , then it will cycle through , zt^ and never visit 

any other critical points. If it chooses — > z^, then it will cycle through 
zj^, z^, and never visit any other critical points. This situation is strongly 
non-Markovian, because the choice the system makes at z^ at any time is 
determined by its choice during the first visit to that saddle point. 

The case where 02 + 03 = 2ai combines certain features of the situations 
described above. The limiting random saddle point sequence explores all 
heteroclinic connections, but it makes asymmetric choices determined by its 
the history, also producing non-Markov dynamics. 

Besides the probability structure of limiting saddle point sequences, our 
main result also provides a description of the random times spent by the 
process at each of the saddles it visits. 

3. Nonrigorous analysis of a linear system 

In this section we consider the diffusion near a saddle point in the simplest 
possible case: 

dX^^x = XiX^,idt + edWi, 

dX^^2 = \2Xe,2dt + £dW2 

with initial conditions X^^i = 0, X^^2 = 1- Here Ai > > A2, and Wi,W2 
are i.i.d. standard Wiener processes. 

Let us study the exit distribution of X^: for the strip {{xi,X2) ■ \xi\ < 1}. 
In other words, we are interested in the distribution of X^{ts), where = 
inf{t : = 1}. Duhamel's principle implies 

= £6^1* f\-^^'dWl{s). 

Jo 

The integral in the r.h.s. converges a.s. to a centered Gaussian r.v. A^i, so 
that for large t, 

Therefore, for the exit time t^, we have to solve 
so that 

T,^^\n{e\N^\)-\ 

So, the time spent by the diffusion in the neighborhood of the saddle point 
is about ln(£:~^). 
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On the other hand, 

X,,2(t) = e^2* + e f e^^^'-'^dW2{s). 
Jo 

and the integral in the r.h.s. converges in distribution to A''2, a centered 
Gaussian r.v. Plugging the expression for into this relation, we get 

{eN2, A2 < -Ai, 

£-A2/Ai|^^|-A2/Ai^ A2>-Ai, 
e{\Ni\+N2), A2 = -Ai. 

Therefore, when the contraction is stronger than expansion (A2 < — Ai), 
the limiting exit distribution is centered Gaussian. In the opposite case 
(A2 > — Ai), the limiting exit distribution is strongly asymmetric and con- 
centrated on the positive semiline reflecting the fact that the initial condition 
-^£,2(0) = 1 was positive and presenting a strong memory effect. In the in- 
termediate case (— A2 = Ai), the limit is the distribution of the sum of a 
symmetric r.v. N2 and a positive r.v. \Ni\, and the resulting asymmetry 
also serves as a basis for a certain memory effect. 

In general, the asymmetry in the exit distribution means that at the next 
visited saddle point the choices of the exit direction will not be symmetric, 
thus leading to non-Mar kovian dynamics. Notice that three types of behav- 
ior for the linear system that we just derived correspond to the three types 
of cycling through the saddle points in the example of Section [2l 

One of the main goals of this paper is the precise mathematical meaning 
of the approximate identities of this section and their generalizations to 
multiplicative white noise perturbations of nonlinear dynamics in higher 
dimensions. 

4. Convergence of graphs in space-time 

The main result of this paper states the convergence of a family of con- 
tinuous processes to a process with jumps. This kind of convergence is 
impossible in the traditionally used Skorokhod topology on the space D of 
processes with left and right limits, since the set of all continuous functions 
is closed in the Skorokhod topology, see [3]. 

In this section we replace D by another extension of the space of con- 
tinuous functions. This extension allows to describe not only the fact of 
an instantaneous jump from one point to another, but also the curve along 
which the jump is made. We shall also introduce an appropriate topology 
to characterize the convergence in this new space. 

It is interesting that in his classical paper [11] Skorokhod introduced sev- 
eral topologies for trajectories with jumps. Only one of them is widely 
known as the Skorokhod topology now. However, the construction that we 
are going to describe here did not appear either in [11], or anywhere else, to 
the best of our knowledge, at least in the literature on stochastic processes. 
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We consider all continuous functions ( "paths" ) 

7 : [0, 1] [0, oo) X 

such that 7*^(5) is nondecreasing in s. Here (and often in this paper) we use 
superscripts to denote coordinates: 7 = (7^,7"^, . . . ,7'^). 

We say that two paths 71 and 72 are equivalent, and write 71 ~ 72 if there 
is a path 7* and nondecreasing surjective functions Ai, A2 : [0, 1] — ?> [0, 1] with 
7i(s) = 7* o Ai(s) and 72(5) = 7* o A2(s) for all s G [0, 1], where o means 
the composition of two functions. (These are essentially reparametrizations 
of the path 7* except that we allow Ai, A2 to be not strictly monotone.) In 
Section [12] we shall prove the following statement: 

Lemma 4.1. The relation ~ on paths is a well-defined equivalence relation. 

Any non-empty class of equivalent paths will be called a curve. We denote 
the set of all curves by X. Our choice of the equivalence relation ensures 
that each curve is a closed set in sup- norm (see Section [T2]) . and we shall be 
able to introduce a metric on X induced by the sup-norm. 

Since each curve in X is nondecreasing in the zeroth coordinate which 
plays the role of time, it can be thought of as the graph of a function 
from [0, r] to for some nonnegative T. However, any value t E [0, T] 
can be attained by a path's "time" coordinate for a whole interval of values 
of the variable parametrizing the curve, thus defining a curve in {t} x M'^, 
which is interpreted as the curve along which the jump at time t is made. 

We would like to introduce a distance in X that would be sensitive to 
the geometry of jump curves, but not to their parametrization. So, for two 
curves ri,r2 G X, we denote 

(4.1) /3(ri,r2)= inf sup |7i(s) -72(5)1, 

7lGri,72Gr2 sg[o,l] 

where | ■ | denotes the Euclidean norm in [0, T] x M*^. 

Theorem 4.1. (1) The function p defined above is a metric on X. 
(2) The space (X, />) is Polish (i.e. complete and separable). 

We postpone the proof of this statement to Section [T2l 

Naturally, any continuous function / : [0,T] M.'^ defines a path 7j by 

(4.2) 7/(0 = (tT, f\tT), f{tT), . . . , /-^(tr)) , t G [0, 1], 

and a curve Tf that is the equivalence class of 7/. 

The following result shows that the convergence of continuous functions in 
sup-norm is consistent with convergence of the associated curves in metric p. 

Lemma 4.2. Let (/n)nGN o,nd g be continuous functions on [0, T] for some 
T > 0. Then 

sup \fn{t) - g{t)\ ^ 0, n^oo 

tG[0,T] 
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is necessary and sufficient for 

/'(r/„,rg)^0, n^oo. 

The proof of this lemma is also given in Section [T2j In fact, it can be 
extended to describe the convergence of graphs of functions with varying 
domains. 

We shall need the following notion in the statement of our main result. 
An element F of X is called piecewise constant if there is a path 7 £ F, a 
number fc G N and families of numbers 

= So < Sl < . . . < S2fe < S2fc+1 = 1, 

= to < ti < . . . < tk-i < tk, 

and points 

yi,...,yk eM^ 

such that (7^(5), . . . ,7'^(s)) = yj for s G [s2j-i, S2j], j = l,...,k, and 
7*^(5) = tj for s € [s2j, S2j+i], j = 0, . . . , k. A piecewise constant T describes 
a particle that sits at point yj between times tj^i and tj, and at time tj 
jumps along the path 7^ = (7-*^, . . . ,7'^)|[s2j,s2j+i]- natural to identify T 
with a sequence of points and jumps, and we write 

(4.3) r = (70,2/1, Atl, 71, y2,At2, 72, . . . ,yfc, Atfc,7fc), 

where Atj = tj — tj-i denotes the time spent by the particle at point yj. 

5. The setting and the main weak convergence result 

In this section we describe the setting and state the main result. The 
conditions of the setting and possible generalizations are discussed in Sec- 
tion [in31 

We assume that the vector field 6 : R"' — )• R'^ is C^-smooth, and the dx d- 
matrix valued function a is also C^. We assume that for each xq, the flow 
S^xq associated to the system (jl.ip is well-defined for all t G M (including 
negative values of t). We assume that b admits a heteroclinic network of a 
special kind that we proceed to describe. 

We suppose that there is a finite or countable set of points (zj)jgC) where 
C = NorC = {l,..., N} for some > 1, with the following properties. 

(i) For each i £ C, there is a neighborhood Ui of Zi and a d x d matrix 
Ai such that 

b{x) = Ai{x - Zi) + Qi{x), xeUi, 

where |(5i(x)| < Cj|x — Zjp, for a constant Cj and every x £ Ui. 
In particular, Zi is a critical point for b, since b{zi) = 0. Moreover, 
we require that 5* is conjugated on Ui to a linear dynamical system 
y = Aiy by a C^-diffeomorphism fi satisfying fi{zi) = 0. This 
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means that for any xq £ Ui, there is to = tQ{xo) > such that for all 
t G (—to, to), 

^MS'xo) = AfiiS'xo). 
at 

(ii) For each i € C, the eigenvalues Aj^i, . . . , Xi^d of Ai are real and simple, 
we also assume that there is an integer with 2 < Vi < d such that 

(5.1) A,,i > . . . > Ai,^^_i > > Ai,^^ > . . . > Ai,rf. 

These requirements mean, in particular, that each Zi is a hyperbolic fixed 
point (saddle) for the dynamics. The Hartman-Grobman theorem (see The- 
orem 6.3.1 in [7j) guarantees the existence of a homeomorphism conjugating 
the flow generated by vector field b to linear dynamics. Our condition (i) 
imposes a stronger requirement for this conjugation to be C^. This re- 
quirement is still often satisfied as follows from the Sternberg linearization 
theorem for hyperbolic fixed points with no resonances, see Theorem 6.6.6 
in [7]. In particular, the cubic system of Section [2] is C°°-conjugated to a 
linear system at each saddle point for typical values of its parameters. 

We also want to make several assumptions on orbits of the flow connect- 
ing these saddle points to each other. First, we denote by Vi^i, . . . ,Vi^d the 
unit eigenvectors associated with the eigenvalues Aj^i, . . . ,Xi^d- The hyper- 
bolicity (and, even more straightforwardly, the conjugation to a linear flow) 
implies that for every i € C there is a d — v + 1-dimensional C^-manifold Wf 
containing Zj such that limt^+oo S^x = Zi for every x G Wf (i.e. W/ is the 
stable manifold associated to Zj.) For each i the unstable manifold of Zi is 
also well-defined. 

However, it is known, see [8] and [2], that if the initial data for the sto- 
chastic fiow are close to the stable manifold then after passing the saddle Zi 
the solution evolves mostly along the invariant manifold associated to the 
highest eigenvalue of A^. So, what we need is the curve 7^ G containing Zi, 
tangent to Vi^i at Zi, and invariant under the fiow. Of course, the intersection 
of 7j with Ui is well-defined and coincides with ff^{span{vi^i) D fi{Ui)). 

For each i G C we denote gi = and set 

qf = gi{±RiVi^i), 
where the numbers {Ri)i(^c ^-I'e chosen so that 

Ui = [x: max |P^'Vi(x)| <fiA CC/i, 

k=l,...,d J 

and these sets are mutually disjoint. Here, for any y G M'^, the number P'^'^y 
is defined by 

d 

y = Y^iP^'^vKk, 

k=l 

and denotes the k-th coordinate of y in the coordinate system defined by 
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We denote the orbits of qf by 7^^, and assume that for each i G C, there 
are numbers n^(i) € C such that 

hm S^qf = 2:„±(j). 

This means that the curves are heterochnic orbits connecting the saddle 
point Zi to saddle points Zn±(i)- We do not prohibit these orbits to be 
homoclinic and connect Zi to itself, i.e. the situations where n^{i) = i are 
allowed. 

For any i £ C we define 

(5.2) hf = mf{t: 5*gf G C/„±(,)}, 

(5.3) xf = S^hf. 

so that hf is the time it takes to travel from qf to the neighborhood of the 
next saddle, and xf is the point of entrance to that neighborhood. 
Our first nondegeneracy assumption is that for all i G C, 

(5.4) ^'"^^^^•^"^«/n±w(^?)/0, 

which means that each heteroclinic orbit ^f has a nontrivial component 
in the direction of the v„± (j\ y , as it approaches z„± a) . Although this 

condition holds true for all systems of interest (e.g., the system considered 
in Section [2]) , it is easy to adapt our reasoning to the situations where other 
components of the projection of fn±{i) (xf) on the stable directions dominate. 

We shall also need a nondegeneracy condition on the linearization of (II. ip 
along jf. For each x G M'^ we consider the fundamental matrix ^x{') solving 
the equation in variations along the orbit (5*x)t>o: 

j^<^x{t) = Db{S'x)^x{t), t>0, 
^x{0) = I. 

For all i,j we denote 

vf^=^^^ihf){DMqf))-\j. 
The technical nondegeneracy assumption on ^ that we need is: 
(5.5) 

P"^«''=Z)/„±(,)(x±)z;J. / 0, z G C, j G {2,1.,}, k = 

Again, we work with this condition since it holds true for any system of 
interest, but it is easy to adapt our reasoning to the situations where this 
condition is violated. 

To formulate our main theorem we need a notion of an entrance- exit 
map describing the limiting behavior of in the neighborhood of a saddle 
points, namely, the asymptotics of the random entrance-exit Poincare map 
as e — 7- 0. 



10 



YURI BAKHTIN 



We denote the set of all probability Borel measures on by V{M.'^). 
Let us denote by luj the set of all triples {x,a, fi) where 



(1) xGUiDW^ satisfies fi{x) / 0; 

(2) aG (0,1]; 

(3) fi G with 

l^{cP: PlDfi{x)<Pj^O} = l, ifQ<l, 

fi{<l) : PfDfi{x)(t) / 0} = 1, if a < 1 and > 2. 

This set will be used to describe the initial condition for equation (jl.2p : 
Xe(0) = X + e"0£, where the distribution of weakly converges to /x as 
e ^ 0. 

We also define 

Out = {(t,p,x,/3,F) : t G (0,oo), p G [0,1], x G M'^, /3 G (0,1], F G P(M'^)} 



Here, the numbers p± define the limiting probabilities of choosing each of 
the two branches of the invariant curve associated with the highest eigen- 
value of the linearization at the saddle point; x± are points on these orbits 
that serve as entrance points to neighborhoods of the next saddle points; 
t± are the times it takes to reach these points under the proper (logarith- 
mic) renormalization; (3 is the scaling exponent so that the exit distribution 
(serving as the entrance distribution to the next saddle's neighborhood) 
takes the form x± + s^ips, where the distribution of V'e converges to 
or F- depending on which of the two branches was chosen. 

It is possible (see Lemma l7.ip to give a precise description of the asymp- 
totic behavior of the diffusion in the neighborhood of each saddle point in 
terms of an appropriate entrance-exit map, i.e. a map that for each saddle 
point computes a description of the exit parameters in terms of the entrance 
parameters: 



For an entrance-exit map (^j)jgc = {'^i,-,'^i,+)iec we shall denote its 
components by U = ti^±, pi^±, Xi^±, /3i = I3i^±, F±^i. 

Suppose xq belongs to one of heteroclinic orbits of the network. A se- 
quence z = (6*0, -Zii , 6*1, , 6*2 . . . , 6k-i,Zi^, , 9k) is called admissible for xq if 

(1) 6o is the positive orbit of xq with limj_j.oo S^xq = Zi^; 

(2) for each j G {1, . . . , k}, Oj = 7+ or 0j = 7^; 

(3) for each j £ {1, . . . , k — 1}, 



and 



Outi = {((t_,p_,x„,^_,F. 



^_),(t+,p+,x+,/3+,F+)) G Out^ : 
t- = U, x± = xf, p^+p^ = 1, /3_ = /3+}. 



: luj Outj, i G C. 
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(5.9) {tj,pj,Xj,aj,^ij) 



The number k = k{z) is called the length of z. 

Our main limit theorem uses entrance-exit maps to assign limiting prob- 
abilities to admissible sequences. Let us proceed to describe this procedure. 

With each admissible sequence z we associate the following sequence: 

(5.6) C(z) = ((xo, ao,/^o),(ii, Pi, ^i,ai,/ii), •••,(**:, Pfc,^A;,afc,Mfc))- 
Here xq = S^^^'^^xq, oq = 1, and 

(5.7) MO = Law |^$xo(i>o)) J^'^^"^ <i>-^{s)a{S'xo)dW{s)j , 
where 

(5.8) t(xo) = mf{t > : S^xq £Un} + 1. 

We add 1 in the r.h.s. so that the distribution is nondegenerate (and the 
maps ^j^^±(3;o, ao, jUo) are well-defined) even if xq G C/j^. All other entries 
in (I5.6P are obtained according to the following recursive procedure. For 
each j, 

1'i„+(ij-i, "j-i, Mj-i), = 
_1'i^,_(xj_i,aj„i,Aii-i), Oj =7-. 

The numbers ti = ti{z), . . . ,tk = ifc(z) defined above play the role of time, 
and the admissible sequence z can be identified with a piecewise constant 
trajectory r(z) G X: 

r(z) = (6*0, Zi^,ti,9i, Zi2,t2,02, . . . , Zii^,tk,Ok)- 

The numbers pi = pi{z), . . . ,pk = Pki^) defined in (15 .Gh play the roles of 
conditional probabilities, and we denote 

(5.10) 7r(z) = pi(z)p2(z) . . .pfc(z). 

The set of all admissible sequences for xq has the structure of a binary 
tree. The natural partial order on it is determined by inclusion. We say 
that a set L of admissible sequences for xq is free if no two sequences in L 
are comparable with respect to this partial order. If additionally, for any 
sequence not from L, it is comparable to one of the sequences from L, the 
set L is called complete. In the language of graph theory, a complete set is 
a section of the binary tree. 

It is clear that for any free set L, 7r(L) < 1, where tt{L) = "^^.ei ^(^)- ^ 
free set L is called conservative if tt{L) = 1. Every complete set is finite and 
conservative. 

We are ready to state our main result now. 

Theorem 5.1. Under the conditions stated above there is an entrance-exit 
map ^ with the following property. 

Let Xq belong to one of the heteroclinic orbits of the network. For each 
e > define a stochastic process by 

(5.11) Z,{t) = Xe{tHe-^)), t>0, 
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where is the strong solution of (jl.ip with initial condition X^{0) = xq. 

For any conservative set L of xo-admissible sequences, there is a family 
of stopping times {T^)s>o such that the distribution of the graph Tz^(t),t<Te 
converges weakly in (X, p) to the measure M^q^l concentrated on the set 

{r(z) : zGL}, 

and satisfying 

(5.12) M,„,i{r(z)} = ^(z), zeL, 

where '7r(z) is defined via ^ in (jS.lOp . 

For any entrance-exit map the family of conservative sets includes all 
finite complete sets, so that the content of Theorem 15.11 is nontrivial. 

Importantly, we actually construct the desired entrance-exit map ^ in the 
proof. This allows to study the details of the limiting process in Section [lOl 
At this point let us just mention that the sequence of saddles visited by 
the limiting process can be Markov or non-Markov depending on the eigen- 
values of the linearizations at the saddle points. We discuss this memory 
effect and some other implications and possible extensions of Theorem 15.11 
in Section [lOl 

6. Exit measure asymptotics 

We shall now apply Theorem l5.1l to the exit problem along a heteroclinic 
network, and formulate a theorem that, in a sense, gives more precise infor- 
mation on the exit distribution than the FW theory. We assume that there 
is a domain D C with piecewise smooth boundary such that xq ^ D. 
The FW theory implies that, as e — ?> 0, the exit measure for the process 
started at xq concentrates at points y € dD that provide the minimum 
value of the so called quasi-potential V{xo,y). Since for all the points that 
are reachable from xq along the heteroclinic network, the quasi-potential 
equals 0, we conclude that in the case of heteroclinic networks, the exit 
measure asymptotically concentrates at the boundary points that can be 
reached from xq along the heteroclinic network. However, this approach 
does not allow to distinguish between the exit points while ours allows to 
determine an exact limiting probability for each exit point. 

We take a point xq on one of the heteroclinic orbits of the network and 
denote by L{xo, D) the set of all the admissible sequences z (of any length k) 
for Xq such that the last curve 9k of the sequence intersects dD transversally 
at a point g(z) (if there are several points of intersection we take the first 
one with respect to the natural order on Ok), and 9j does not intersect dD 
for all j < k. Let 

Te{xo,D) = mf{t:Xeit) G dD}. 

The distribution of X£{t£{xq, D)) is concentrated on dD. 
For each z G L{xo,D), one can define 7r(z) via (|5.10p . 
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Theorem 6.1. For the setting described above, if the set L{xq,D) is con- 
servative, then the distribution of X{t£{xo, D)) converges weakly, as e —?■ 0, 
to 

Pxo,D= 7r(z)5g(2). 

zeL{xo,D) 

Proof. Let us use Theorem lS.ll to choose the times (Te)e>o providing conver- 
gence of the distribution of ^z^{t),t<Ts to ^xo,l{xo,d)- The theorem fohows 
since Xs{ti;{xo, D)) is a functional of ^Zeit),t<T^i continuous on the support 
of M^o.iCa^o,^')- ^ 

Remark 6.1. Notice that for different sequences z and z' it is possible to 
have q{z) = q{z') so that an exit point can accumulate its limiting proba- 
bility from a variety of admissible sequences. 

Remark 6.2. The behavior of the system up to t^^xqjD) is entirely de- 
termined by the drift and diffusion coefficients inside D. Therefore, there 
is an obvious generalization of this theorem for heteroclinic networks in a 
domain, where one requires the invariant manifolds associated to the highest 
eigenvalue at a critical point to connect that critical point either to another 
critical point, or to a point on dD. An advantage of that theorem is that one 
does not have to specify the (irrelevant) coefficients of (|1.2p outside of D. 
We omit the precise formulation for brevity. 

Remark 6.3. In the case of nonconservative L{xq,D), the limit theorem 
is harder to formulate. In the limit, the exit happens along the sequences 
belonging to L = L{xo,D) with positive probability vr(L) < 1. With proba- 
bility 1 — tt{L), the exit happens in a more complicated way (and in a longer 
than logarithmic time) depending on the details of the driving vector field. 

7. Proof of Theorem 15.11 

We begin with the central lemma that we need in the proof. It has a 
lengthy statement, and after formulating it, we also give a brief informal 
explanation. 

For each i G C we introduce and via 

(7.1) Ut = {x£U: P''^fi{x) = ±R,}. 

Lemma 7.1. For each i (z C, there is a map 

= {{ti-,Pi-,Xi-,/3i-,Fi-), (tj,+,pi,+ ,Xi,+,/3j,+,Fj,+)) : In^ OuU 

with the following property. 

Take any {x,a,fi) € Inj and any family of distributions (/^e)e>o in 'P(M'^) 
with fi;; ^ fi as e — > 0. For each e > 0, consider the solution of (II. 2p 
with initial condition 



(7.2) 



X,(0) = x + e"<^„ 
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where 

(7.3) Law((/>£) = e > 0, 

and define a stopping time 

Tont,e = mf{t > : X,{t) G C/+ U Ur}, e > 0, 
and two events 

A,±,s = {Xe{Tont,e) G U^}, S > 0. 

Then 

(1) Ase^O, 

ti,±[x,a,n). 



In(e-i) 

(2) ^se^O, 

(3) There is a family of random vectors (V'j,e)e>o such that 

(a) /or every e > 0, on ^j,±,£ 

^.(Tout.e + hf) = Xi,±(x, a, /i) + e^-±("'"'^)Vi,e, 

where hf was defined in (j5.2p ; 

(b) as e — ;> 0, 

Law(V'i,e|^j,±,£) => -^j,±(a;,a,//); 

(c) 



(4) For any r > 0, there is T[r) such that, as e ^ 0, 



1,2, i/>2, 
1/ = 2. 



P { sup |X£(t) - >r \ ^0, 

I 0<t<T(r) I 



P< sup lX£(t)-Zi| >r ^ ^0, 

I T(r)<t<Tout,,-r(r) I 



P ^i,±,£ n i sup - 5*-'^-*-^(?i,±| > r n ^ 0. 

\ lrout,£-T(r)<t<Tout,E+^* J / 

(5) For any r > 0, as e — > 0, 



p L4,,±,£n<^ sup |x,(rout,£ + /if + t)-5*x,,±| >rU ^0. 

V [o<t<(lne-i)i/2 J y 
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Part ([T]) of the lemma describes the asymptotic behavior of exit times. 
Part ([2]) determines the hmiting probabihties of choosing each of the two 
outgoing heterochnic orbits. Part ([3]) describes the entrance distribution for 
the next visited saddle point (it takes Tout,e + hf^ to reach its neighborhood); 
parts (a) and (b) give the asymptotic scaling law, and part (c) ensures 
that we can apply this lemma at the next saddle, too, which gives rise 
to the iteration scheme (see the definition of luj). Part formalizes the 
fact that for small e, with high probability, the diffusion trajectory first 
closely follows the deterministic trajectory S^x, then spends some time in a 
small neighborhood of the saddle point, and then follows closely one of the 
outgoing heteroclinic connections until it reaches the neighborhood of the 
next saddle point. Part ([5]) shows that after reaching the neighborhood of 
the next saddle point the trajectory continues to follow the same heteroclinic 
connection for a sublogarithmic time. 

The proof of this Lemma will be given in Sections [8] and [9j The solution 
spends most of the time in the neighborhood of the saddle points and in 
beetween it travels from one saddle point to another along a heteroclinic 
connection. We split the analysis in two parts accordingly. In Section [8] we 
describe the behavior of the system in the neighborhood of the saddle point 
assuming that the initial data is given by ()7.2p . In Section [9] we describe the 
motion between neighborhoods of two saddle points and finish the proof of 
Lemma 17. 11 

The rest of this section is devoted to the derivation of our main result 
from Lemma l7. 11 

Proof of Theorem I5.il We have to show that the map ^ constructed in 
Lemma |7. 1 1 satisfies the statement of the theorem. 

For any sequence z = {6q, Zi-^,6i, Zi^,62 ■ ■ ■ ,6k-i, Zif^,9k) and any s we 
define a sequence of stopping times in the following way. First, we set 

7"e,z,l,in = i{xo), 

where i(xQ) was defined in ()5.8p . Then, for j = 1, . . . , /c, we recall that 
was defined in (j7.ip and define recursively 

mf{t > r,,,j-in : X,(t) G [/+}, 9^ = 7+, 

Te,z,j,out + h+, Oj=lt,^ 
re,z,j,out + K^, Oj=li^. 

Less formally, for each j, the process Xf, leaves the neighborhood of 
at time z out and travels for time tf along one of the two heteroclinic 
connections 6j = ^fi emerging from Zi.. At time T£^z,j+i,in the solution is in 
the neighborhood of the next saddle point of the sequence, close to xf.. 
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The last point of the sequence z is special. We define now 

_ r^,z,fc+i,in+(ln£~^)^/^ 
In(e-i) 

and 

=£-1 A inf T,,^. 

The term is introduced to make finite (to take into account the 
improbable case where does not evolve along any z € L) so that the 
curve Tz^(t),t<Ts is well-defined. 

According to Theorem 2.1 in [3j, it suffices to show that for any open set 

G G X, 

hminf P{r^^(,),,<:r^ e G} > M^,,l{G), 

where the probability measure M^q^l is defined by (]5.12p . Since the mea- 
sure MxQ,L is discrete, it is sufficient to check that for any z G L and any 
open set G C X containing r(z), 

(7.4) lim P{r^^(,),,<r^ EG}>vr(z). 

We start with the linearization along the orbit S^xq. It follows from |4] 
(see also Lemma l9.2p that 

Xe{i{xo)) = xo + ecpe, 

with Law(0£) ^ where fiQ was defined in ()5.7p . 

Now the strong Markov property allows to apply Lemma 17.11 iteratively 
along the saddle points of sequence z. We can complete the proof by 
reparametrizing the heteroclinic connections of z appropriately and applying 
the following proximity criterion to derive (j7.4p . 

Lemma 7.2. Suppose that 6 is a positive number and a path 7 defines a 
piecewise constant curve T as given by (14. 3p . Suppose a continuous func- 
tion f and nondecreasing nonnegative number sequences {rm)m=o '^"'^ i^m)m= 
satisfy the following properties: 

(1) tj - 5 < r2j < tj < r2j+i <tj + 5, j = 0,... , k; 

(2) \f{r) - yj\ < 6 forj = l,...,kandr£[r2j^i,r2j]; 

(3) S2j < S2j < S2j+i < S2j+i, j = 0,...,k; 

(4) for each j = 0, . . . ,k, there is a nondecreasing bijection 

Aj : [r2j,r2j+i] ^ [^j.^j+i] 

such that 

|7(A,(r)) - {r,f\r),...,f\r))\ < 5, r G h^ra.+i]; 

(5) for each j = 0, . . . ,k, 

his) -7(s2i)! <S, S e [s2j,S2j], 
his) - 7(s2j+l)| <S, S e [S2j+1, S2j+l]- 

Then pir,Tf) < 36. 
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The proof of Lemma 17.21 is given in Section [12] □ 

8. The system in the neighborhood of a saddle point 

In this section we fix i G C and consider a saddle point Zi. We recall our 
assumption that the dynamics generated by 6 in a small neighborhood Ui 
of Zi is conjugated by a C'^-diffeomorphism fi to that generated by a linear 
vector field generated by a matrix Ai in a neighborhood of the origin. In this 
section we often denote Ai, fi, etc. by A,f, etc., omitting the dependence 
on i. In particular, the flow generated by the linearized vector field is given 
by e*"^ and sometimes will be denoted by S^. 

We recall that the eigenvalues Ai, . . . , A^^ of A are real and simple, and 
there is a number v > 2 such that 

Al > . . . > X,y-l > > Xu > . . . > Xd- 

We denote the associated eigenvectors by vi, . . . ,Vd, and introduce the co- 
ordinates , . . . ,u'^ of a vector u via u = Ylk=i '^^'^k- We define 

L = span{v2, . . . , Vd}, = span{f,,, . . . , Vd}, 

and denote hy Hl the projection on L along vi. 

We begin the analysis with the derivation of the Ito equation for Y^{t) = 
f{Xf,{t)). The Ito formula gives: 

d 2 

k=l k,l=l 
d d 



k=l k=l 

2 



k,l=l 

where a* denotes the transpose of a, and ((T(T*)'^'(x) denotes (cr(x)cr*(a;)))^'. 
Since the pushforward of the vector field b under / at a point y is given 
by Ay, we have 

Ay = Df{g{y))b{g{y)), 
where g = f^^, and the equation above rewrites as 

d d 

dYi{t) = Y,AiY,Ht)dt + eY,dkf{g{Ysm^i{9{YemdW"^{t) 

k=l k=l 
.2 d 

+ y E i<^cT*f{g{Ysmdkif{9{Ysmdt. 

k,l=l 

We rewrite the last identity as 
(8.1) dY,{t) = AYe{t)dt + eB{Y,{t))dW{t) + e^C(Ye{t))dt, 
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or, equivalently, 

dYj{t) = A{Y^^{t)dt + eB^Ys{t))dW{t) + e^C^ {Y,{t))dt 

d 

= A{Y^{t)dt + e Bi{Y,{t))dW\t) + e^C\Y,{t))dt. 

k=l 

We see that B and C are continuous and bounded in the neighborhood f{U) 
of the origin, and B is nodegenerate. 

It is also clear that if we assume (17. 2p . then 

(8.2) Y,{0) = yo + e''^e, 

where yo = f{x), and converges, as e — > 0, in distribution to = 
DF{xq)(I)q, with Law((^o) = A*) see ()7.3p . 

Recall that for the saddle point z = Zi, two neighborhoods U C U are 
defined. Define V = f{U) and V = f{U), so that 

y = {2/GR^: |y^|<i?,i = l,...,(i}cy, 

(we use the notation R = Ri, U = Ui, U = Ui in this section). Then 
yo ^ V n . In particular, yQ = for all k < u. 

In the remainder of this section we study the system (jS.ip with initial data 
given by ()8.2p . The solution is actually defined up to a stopping time tv,e 
at which the solution hits dV. Let us define another stopping time 

te = inf{t > : \YXt)\ = R} A tv,e- 

As we shall see later, 

P{te < tv,e} ^1, e ^ 0, 

and thus it makes sense to study the asymptotic behavior of and Y^te). 

To state our main result on system (j8.ip . (|8.2p . see Lemma 18.11 below, 
we have to introduce a certain multidimensional distribution that is easier 
to describe in terms of random variables defined on some sufficiently rich 
probability space. 

We start with the random vector introduced after (j8.2p . Then, on the 
same probability space we define a — l)-dimensional centered Gaussian 
vector {Nq, . . . , Nq~^) with covariance 

POO 

(8.3) EN^N^,= e-^^^+^^>{BB*)'''{S%yo)ds, k,j<u, 

Jo 

independent of ^o- Next, we define (x^, . . . , x'^"^) by 

(8.4) x'= = eo^ + <l„=i, k<u. 

Finally, on the same probability space we define a random vector {N^ , . . . , J 
Conditioned on each of the two events {sgn(x^) = ±1}, it is a centered 
Gaussian vector with 

fO 

ENI^N^= e-^^^+^^>{BB*)''^{S%{±Rvi))ds, k,j>u, 

J —oo 
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and independent of >c'^ , . . . , >c'^~^). 

Lemma 8.1. Suppose the following nondegeneracy conditions are satisfied: 

+ 0, 

P{eo7^0} = l, i/a<l, 
P{^2 _^ 0} = 1, ij a< \ andv>1. 

Then there exists a number P, and a random vector (y',^',C) such that the 
random vector 



(Y,\t,), e~''ULUt,), i.-f In- 



converges in distribution to {y',^',C)- More precisely, 
1, 



a 



V = 2 and — aX^ > Ai, 
u = 2 and — aXi, < Ai, 
, 1/ > 2 and - A^ > Ai - A2, 



—a 



Ai ■ 



u > 2 and — X^, < Xi — A2, 



and 



(8.5) 



with 



and 



y' = sgn(x^)i?, 

iV, 1/ = 2 and — aXi, > Ai, 

N + rj^, = 2 and — aXi, = Xi, 

?7_, = 2 and — aXi, < Ai, 

77+, u > 2 and — A,/ > Ai — A2, 

77+ + r/-, u > 2 and — Aj/ = Ai — A2, 

r/_, > 2 and — Xi, < Xi — X2, 



— I 11-— 2 
r/+ = R^i \k \ ^1 X f2- 



Vk- 



k=i' 



Remark 8.1. Even if the nondegeneracy assumptions do not hold, a version 
of this lemma still holds true. This will be obvious from the proof, and, for 
brevity, we omit a variety of related results on these degenerate situations. 
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Remark 8.2. We see that of ah random variables A^, involved in the 

description of the limit, conditioned on sgn(x^), only N does not depend 
in any way on the initial distribution data given by yo, a, and ^o- This 
guarantees the Markovian loss of memory for the case [v = 2; —aXu > Ai], 
and potentially leads to non-Markov situations in all the other cases, see 
Section [10] for further discussion. 

The proof consists of two parts. The first part provides the analysis of 
the evolution of mostly along the stable manifold. The second part is 
mostly responsible for the motion along the unstable manifold of the origin. 

Using Ito's formula it is easy to verify that Duhamel's principle holds: 



Jo Jo 



{s))ds, 



or, equivalently, 
(8.6) 

Y^^t) = e^"* (^y^ + e^^e + ^ e-^^'B''{Ye{s))dW{s) + e~^^'C''iYe{s))ds 

We start with a study of the outcome of the evolution of along the 
stable manifold. We fix a number a G (0,a). Our first goal is to analyze 
the distribution of ye(re), where 

Ts = mm{T^, k = 1,. . . ,d}, 

and for every k = 1, . . . ,d, 

T^ = mf{t:\Y^\t)-iS'^yo)'\=e^}. 

Lemma 8.2. 

sup \YeiTe At)- (S'^^^ yo)\ 4 0, E ^ OO. 
t>0 

Proof. This statement is obvious due to 

(8.7) sup \Y,{t, At)- {S'^^^yo)\ < de", 

i>0 

which follows from the definition of r^. □ 
Lemma 8.3. 

p 

Tg — >■ oo, e — >■ oo. 
A proof based on (j8.6p is given in Section [TTl 
Lemma 8.4. 

P{Te = T^}^1, e ^ OO. 

A sketch of a proof is also given in Section [TTJ 



NOISY HETEROCLINIC NETWORKS 



21 



Let US now take a closer look at the stochastic integral term in the ex- 
pression (I8.6p . For k < V we introduce 



f e-^>''B''{Y,{s))dW{s) = V / e-'"''Bi{Y,{s))dW'^{s) 
Jo 



and 



m=l 



B\S%yo)dW{s) = / e-^'''Bi{S%yo)dW"'{s). 



A straightforward application based on BDG inequalities (see e.g. [6| The- 
orem 3.28, Chapter 3]), local Lipschitzness of B, and ()8.7p implies that, as 
e ^ 0, 

ft 



sup 



N^{t)-M''{t) 



sup 



e-^'''{B'^{Y,{s)) - B'{S%yo))dW{s) 



k I' OS , 



0. 



Lemma [8.3l implies that the terminal value Ni:{ts) converges to M(oo). Com- 
puting EM^(oo)MJ(oo), we see that Law(M(cx))) = Law(A'o), where A'^o is 
a centered Gaussian vector defined in (18.31). Therefore, 



We also notice that 



sup 



e' I e~^^'C''{Ye{s))ds 



op(e), 



where we use the notation (f){e) = op(^(e)) for any families of random vari- 
ables (l){e),'il;{e),e > such that A as e — > 0. 
Therefore, for k < ly we have 

(8.9) Y,'{T,) = e''^^^{e''C^ + eN^{T,) + op{e)). 
We denote 

x,^ = y,^(T,)e-^'=^^^e-", 

so that 

x,^ = 4^ + ei-"iVi=(T,) + op(ei~"), 

and rewrite ()8.9p as 

(8.10) Y^^{Te) = e^'=^^e°x^ 
Lemma 18.41 implies that 

(8.11) p{\Y^\r,)\ = e'^} ^ I, 

So, taking logarithms of 



we see that with probability approaching 1, as e — > 0, 



a — a 



In lx_ 
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and 

(8.13) Y,\r,)e-^ = sgni><l), 

Plugging (I8.12|) into (jS.lOp . we obtain the following result: 
Lemma 8.5. For 1 < k < u, 

This lemma describes the asymptotics of Y^{T£),k < v very precisely 
since due to (I8.12P we have the following obvious statement: 

Lemma 8.6. 

( t- a- a 1 Law ln[x^| ^ A 
[L^Te ^lne,x^\...,< M ^ Uo, M, 

where the random variables in the r.h.s. were defined before the statement 
of Lemma \8.1\ 



We shall now consider k >v. For these values of /c, we denote 

= e^^* f e-^>''B^{Ye{s))dW{s). 
Jo 

Let us take any numbers T G N and p > 0, and use BDG inequalities to 
write 



P I sup \R'',{t A Te)! > e^pj < V P I sup R^{t A t,) 

[selO,T] J [teln,n+l] 

< V P <^ sup / e-^'''B''(Yeis))dW{s 

„-n U<n+1 JO 



n=0 

(8.14) < KTe-'^P, 

for some -fC. 
Since the term 



e-^^'C'{Ys{s))ds 







is bounded on t < r^, we see that (|8.6|) . (18.121) and (|8.14|) imply the following 
result: 

Lemma 8.7. For all k >u and any p > 0, 

Lemmas 18.51 18.61 and 18.71 provide all the necessary information on the 
behavior of 1^ up to r^. We now turn to the second part of the proof, the 
analysis of the evolution of Y^ after r^. 
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Let Ye{t) = Ys{Te+t). We shall study % conditioned on ^^(0) = Yj-{Te) = 
ibe°. We consider only the case of since the analysis of the other case 
is entirely the same. First, we rewrite (18.61) for Y^: 
(8.15) 

Y^k{t) = e^^* (y^H^s) + e e-^^'B^{Ye{s))dW{s) + ^ e~^'''C^{Ye{s))ds^ 

Let Te = inf{t : \Y^ \ = R}. We can rewrite (|8.15p for A; = 1, t = -fj as 

(8.16) i;i(f,) = e^i^=e'^(l + 7y,), 
with 

i]e = e^"" r e-^^^'B^{%{s))dW{s) + e^"" P e-^''C^{Ye{s))ds ^ 0. 
Jo Jo 

The last relation is obvious if B and C are bounded. In the general case it 
follows from a localization argument. 
Relation (j8.16p implies 

(8.17) f, = -^lne+ ^In- ^ 



Ai Ai 1 + r]e 

Plugging this into (I8.15P and applying Lemmas 18.51 and 18.71 we can prove the 
following statement: 

Lemma 8.8. 

(8.18) sup\Ye{t)-S\(e^vi)\^0. 

However, we need a more detailed information on l^(fg). To that end, we 
analyze Y^{f£) separately for 2 < k < u and u < k < d. 

For 2 < k < v, plugging (18.171) into (IS.lSh . using Lemma 18.51 for the first 
term and Lemma 18.81 to estimate the integral terms, we see that 



^.19) 



= A^)^A^^fc|^i| aJ(i + op(i))^ 2<k<l^. 

For k > I' we denote 

= e^"^- r e-^'''B\Ye{s))dWis) 
Jo 

Lemma 8.9. As e ^ 0, 

(iv,^...,iv,'^)^A"(iVo^...,iVo^), 

where {Nq , ■ ■ ■ ,Nq) is the centered Gaussian vector defined before the state- 



ment of Lemma \8.1 
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Proof. Denoting 

we obtain 
and 



, a 1 

= — — In e + — In i? — > oo, 
Ai Ai 



Tg — > CXO, £ — > 

- r' 4 0, e ^ ( 



Therefore, 

f-ri 

i.20) - e^''^ / e~^>^'B\Y,{s))dW{s) 4 0. 

^0 



It follows from Lemma 18.81 that 



r' 

.21) e^^^^ / e-^>^'B^{Y,{s))dWis) - 0, 

Jo 



where 



N^ = e^^< f \~^'^'B^{S%e^vi)dW{s), k>v. 
Jo 

We see that {N^,...,N^) is a centered Gaussian vector with 
Jo 

rO , 
fO 

(8.22) ^ / e-(^^+^^)"(SS*)'=J(5:4-R7;i)(ir, e ^ 0, 

J — oo 

where the second indentity follows from the change of variables s — = r, 
and the convergence in the last line is implied by the uniform convergence 
S^j^^'e^vi S\Rvi, r < 0. 

Lemma EJ] follows now from (l8:20|) - (18:221) . □ 

Now, for k>u^ equations ()8.15p . (|8.17p . and Lemma 18.71 implv: 

Y^{T,) = e<~^^''''^^.'''^) (^e^(^--)|xir^(yo^ + 5%^) + op(ei-P)) + sN^^ + op{e) 
(8.23) 

= e-^"i?M Ix^r M y^(i + op(i)) + op(e'"^"^") + eN^ + op(e). 



We are now ready to finish the proof of Lemma 18. 1[ We notice that 
formulas analogous to (I8.19P and (j8.23p hold true if we condition on 1^/(0) = 
Y}{ti.) = — e". Since Y^ife) = Yi;{te) and tg = Tg + fg, Lemma [8.11 is a 
consequence of the strong Markov property and an elementary analysis of 
relations (l8l2]l . (f8T7l) . (f8l9]) . ([8:23]) . The proof reduces to extracting the 
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leading order terms in ()8.19p . (18.230 in each of the cases that appear in the 
statement of the lemma. 

For example, in the case of > 2 and —X^ < Ai — A2, the greatest 

af 1 '^^ ) 

contribution (as e ^ 0) in (jS.lOp is of order of e ^1 and corresponds 

to k = 2. The greatest contribution in (j8.23p is of order of e ^1 and 
corresponds to k = i' since one can choose p sufficiently small and neglect 

the asymptotic contribution of the op{e ^ ^1") term. (Notice also that 

in this case the op(e) term can also be neglected since e = o{e ^1 ) due 

to a < 1 and — ^ < ^^^^^ ^ -'-■) Among these two contributions, e ^1" 
dominates providing the desired asymptotics 

The analysis of all the other cases is similar, and we omit it. 

Having Lemma [8. II at hand, it is straightforward to write down the asymp- 
totics for the original process ^^(te) = g{Yfr(ti;)). 

Lemma 8.10. Let solve the system ()1.2p with initial condition ()7.2p and 
assume (x, a, //) € Inj. Let us define x^, via ()8.2p . ()8.3p . (j8.4p . and (3, ^' , ( 
as in Lemma \8.1\ 
Then 



(8.24) X,{t,)=q, + e^(t>',, 

where 

P{qe = g{±R)} = l, £>0, 

and 

Here 
and 



leiYei 



t,-^lni)^A"((Z,<A',C), e^O. 



f = {Df{q))-'i'. 

Remark 8.3. The case of a = 1 and = is also covered by this lemma. 
This situation corresponds to the deterministic initial condition for all e > 0. 

9. Asymptotics along the heteroclinic orbit 

In this section we consider the equation (|1.2p . equipped with the initial 
condition 

X,(0) = xo + e"(^e, 
on a finite time horizon (up to a nonrandom time T). Here a € (0,1], 
and (</>£) £>o is a family of random vectors satisfying 

6r_ ^-T (hn, e ^ 0, 
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for some nondegenerate random vector (f)Q, independent of the Wiener pro- 
cess driving the equation. 

The fohowing statement is elementary. 

Lemma 9.1. As e ^ 0, 

sup \Xeit) - S^Xol 0. 

te[o,T] 

We are going to give more precise asymptotics in the spirit of [4] . To that 
end we denote 

Y{t) = X{t) - S'xo, 

and write 



y,(t) = + / (6(X,(s) - biS'xo)))ds + e / aiX,is))dWis) 
Jo Jo 

= e''<Pe+ [ Db{S'xo)Y,{s)ds+ [ Q{S'xQ,Ye{s))ds + e I a{X,{s))dW{s), 
Jo Jo Jo 

where 

(9.1) \Q{x,y)\<Cy^, 

for some constant C and all y with \y\ < 1. Treating this as a linear nonho- 

mogeneous equation we apply the Duhamel principle to see that 

(9-2) 

Y,{t) = e''^,,{t)^,+e1>,,{t) [ '^~^\s)a{X,{s))dW{s)+'^,,{t) [ ^-^\s)Q{S'xo,Y,is))ds. 

Jo Jo 

We introduce a stopping time by 

r, = inf{t: \Ye{t)\ > eK^^")}. 
On the event {r^ < T} we use (19. 2p to write 
(9.3) Ye{t ATe) = Ii{t,e) + l2{t,e) + h{t,e), 

with 

\h{t,e)\ = |e"^>,o(tAr,)0,| <i^ie"|</.,| 
for some constant Ki, 



\l2{t,e)\=e 



rt/\Te 

$,o(tAr,) / <^>-^\s)a{X,{s))dWis] 
Jo 



where A^^ is a tight (in C with sup-norm) family of continuous processes due 
to Lemma 19.1^ and, finally, 



for some constant due to (|9.ip . It now follows from (|9.3p for t = T, that 
on {Te < T} 

el(iAa) < +e|iV,(r,)| +7^3^1(1^"), 
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which automaticahy imphes 

PK < T} ^ 0, e ^ 0. 
On the complement, {r^ > T}, we have 

(9.4) UT) = l[{e) + I^{e) + lUe), 
where 

(9.5) I[{e) = e'''^,,{T)(^s, 

(9.6) /^(e) = e<l>,,iT) <i>-^\s)aiX,is))dWis), 

Jo 

(9.7) |4(e)| < K4e^(^^") 
Now (j9.4p — (j9.7p imply the following result: 

Lemma 9.2. For every e > 

X,(r) = 5^xo + e"<^e, 

where 

7 Law 7 „ 

with 

N being a Gaussian vector: 

N = ^^,{T) r '^-^\s)a{S'xo)dW{s). 
Jo 

Remark 9.1. The lemma also holds true for the case where a = 1 and 
(j)^ = 0. This situation corresponds to the deterministic initial condition for 
ah e > 0. 

We can now finish the proof of Lemma 17.11 
Proof of Lemma\7.1\ Parts [Hand [2] follow directly from Lemma [8. 101 Part [3] 



follows from consecutive application of Lemmas 18.101 and 19.21 together with 
Strong Markov property and the nondegeneracy assumption (]5.5p . The last 
two statements of the Lemma are obvious in view of the analysis above, and 
we omit their proofs. □ 

Remark 9.2. Notice that not only we are able to prove that the entrance- 
exit map with desired properties exists, but we also can describe this map 
explicitly, tracing the details from Lemmas 18.11 and 19.21 For example, 

Vi,±{x,a,^j) = P{sgn(x^) = ±1}, 

where is the random variable constructed in Lemma 18.11 applied to the 
linearization about z,-. 
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10. Discussion 

In this section we informally comment on implications of our main result 
and its possible extensions. 

10.1. The structure of the limiting process. According to our main 
result, the limiting probability for the solution of the system to evolve along 
a sequence of heteroclinic connections is computed by a recursive procedure 
described in (fSlBjl - lfSJO]) . 

The key result for the analysis of the Markov property for the limiting 
process is Lemma 18.1^ which gives the "exit distribution" Law(^') via the 
"entrance distribution" Law(^o)- 

Analyzing (|8.5p . we see that in the case where for each z G C we have 
Ui = 2, and —\i^2 > (i-e. contraction is stronger than expansion) our 
iteration scheme gives that at each saddle point the limiting process chooses 
each of the outgoing heteroclinic connections with equal probabilities, thus 
defining a simple random walk on the network viewed as a directed graph. 

However, in general, the limiting distribution on sequences of saddles does 
not necessarily define a Markov chain. In fact, if the exit distribution defined 
in (jS.Sp involves rj-, then it is asymmetric. Therefore, at the next saddle 
point, the choice between the two heteroclinic orbits is asymmetric as well, 
and probabilities are generically not equal to 1/2. In fact they may equal 
or 1. 

Let us now look at 77+ , the other random variable involved in the exit 
distribution. If the entrance distribution is symmetric, then rj^ also has 
symmetric distribution. On the other hand, if the entrance distribution is 
asymmetric, then r/+ is also (typically) asymmetric. 

Notice that there are plenty of situations where the entrance distribution 
is strongly asymmetric (concentrated on a semiline) and the probabilities to 
choose one of the two outgoing connections are and 1, i.e. the choice is 
asymptotically deterministic (and dependent on the history of the process). 

So, there are many possibilities, but roughly the limiting random walk 
on the saddles may be described as follows. The system starts evolving in 
a Markov fashion (choosing the next saddle out of two possible ones with 
probabilities 1/2 independently of the history of the process) until it meets 
a saddle point at which the exit distribution becomes asymmetric. After 
that, the choice of one of the two heteroclinic connections is not Markov 
any more (being defined by the entrance distribution whose asymmetry is in 
turn defined by the history of the process.) Then, at each saddle point the 
following three things may happen: (i) new asymmetry is brought in due 
to the presence of r/_ in the exit asymptotics; (ii) the asymmetry present 
in the entrance distribution is transferred to the exit distribution by 77+ ; 
(iii) u = 2 and the contraction is strong enough to ensure that the exit 
distribution does not involve rj- or rj^ thus being symmetric. In the first 
two cases the system remembers its past encoded in the asymmetry of r]±. 
However, in the last case, the system loses all the memory and goes back 
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to the "Markov mode" (which is just a convenient name for this phase of 
the system's evolution; of course the system is not truly Markov since some 
information from the past is encoded in the fact that the system is in the 
"Markov mode" presently.) 

In particular, if every entrance distribution involved is either symmetric 
or strongly asymmetric (this excludes the "rare" cases of \y = 2; —a\y = Ai] 
and [v > 2; —X,y = Ai — A2]), then the two splitting probabilities are either 
both equal to 1/2, or equal and 1 respectively. 

It is instructional to find the limiting exit measure for the 2-dimensional 
example shown on Figure [H where the domain contains three saddle points 
zi,Z2, and Z3 . Let us assume that the linearizations at these points have the 
same eigenvalues Ai > > A2. According to Theorem l6.lt the exit measure 
will weakly converge, as e — t- 0, to 

where yi, ... ,1/4 are the points on the boundary that are reachable along 
the network. 

If — A2 > Ai then we have Markov evolution along the network resulting 
in pi = 1/2, p2 = 1/4, p3 = p4 = 1/8. 

If — A2 < Ai, then the exit distribution at each saddle point is strongly 
asymmetric which results in deterministic motion after the first bifurcation 
so that pi = 1/2, p2 = 0, P3 = 0, P4 = 1/2. 
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If — A2 = Ai, then the exit distribution N + rj- at each saddle point is 
asymmetric, so that the evolution is not Markov, and pi = 1/2, < p2 < 
1/4, < p3 < p4. 

10.2. Dependence of the exit measure on the diffusion coefficient. 

Suppose that the vector field b is given. From our basic iteration procedure 
we see that if for some matrix-valued function a at each saddle point the 
entrance distribution is either symmetric or strongly asymmetric, then for 
any other choice of diffusion coefficients, the situation is the same, so that 
the limiting process given by Theorem 15.11 and the exit measure asymp- 
totics given by Theorem 16.11 do not depend of a. Therefore, these theorems 
describe an intrinsic property of the vector field b. 

10.3. Our set of assumptions and possible generalizations. We begin 
with an obvious and important extension of our theorem that we are going 
to mention without any proof. We assumed so far that every critical point in 
the network is a saddle with nontrivial stable and unstable manifolds. Let us 
see what happens if we allow situations where some of the critical points of 
the network are actually attracting nodes (this corresponds to Mj = 1, where 
i/j is the order number of the first negative eigenvalue of the linearization 
at Zi). It follows from the FW theory that if the initial condition belongs 
to a neighborhood of such a point, then with probability approaching 1 as 
e — )• 0, the trajectory will not leave that neighborhood within the time of 
order ln(e~^). Therefore, our main result holds true for this situation as 
well, with the following correction: with probability 1, for every i £ C with 

= 1, if the limiting process reaches Zi, then it stays there indefinitely. 

Another essential assumption that we have used is that near each sad- 
dle point the dynamics is C^-smoothly conjugated to a linear flow. Al- 
though this assumption leaves aside a number of interesting systems (one 
example is 2-dimensional Hamiltonian dynamics), it is not too much re- 
strictive. Namely, a smooth conjugation to linear dynamics is possible for 
saddle points satisfying so called no resonance conditions, see a discussion 
of local conjugation to a linear flow in [71 Section 6.6]). In most examples 
of heteroclinic networks in the survey [9] all the saddles satisfy the no reso- 
nance conditions for almost all parameters involved in the description of the 
system. 

Although our proof relies entirely on the analysis of the conjugated linear 
systems, we believe that our results can be extended to the situation with 
local dynamics that has no smooth linearization. The most promising way 
to approach the understanding of that situation is to use normal forms (see 
[3 Section 6.6]). 

Our assumption that all the eigenvalues of matrices Ai are real and simple 
can be weakened. We could have actually assumed that Ai,A2,Ai/ £ M are 
simple, and 

Ai > A2 > KA3 >...>0>Aj,>...> KAt.+i > 
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Obviously, the case where the eigenvalue with greatest real part has a 
nonzero imaginary part leads to a completely different picture since the in- 
variant manifold associated to Ai is two-dimensional in that case. The case 
where A2 or are not real has a lot in common with the case considered in 
this paper, since for small values of e the system tends to evolve along hete- 
roclinic connections. However, instead of a unique limit distribution for Z^, 
there may be a set of limit distributions, i.e., the attractor in the space of 
distributions may be non-trivial. The reason is that due to the rotation 
associated with the imaginary part, the exit distribution at a saddle point 
is not well-defined. An analogue of Lemma 18.11 holds true, but the normal- 
ization by should be replaced by a more sophisticated one involving a 
rotation by an angle proportional to ln(e~^). Even more complicated situa- 
tions arise if, instead of connecting hyperbolic fixed points, the heteroclinic 
orbits connect limit cycles or other invariant sets. 

The last assumption we would like to discuss is the assumption that the 
set L is conservative. Notice that Theorem 15 . 1 1 does not necessarily describe 
the asymptotic behavior of the diffusion up to infinite time. Indeed, times 
spent at subsequently visited saddles can become smaller and smaller so that 
the total time accumulates to a finite value. There is no unique deterministic 
time horizon that serves all limiting trajectories at once, since these values 
may differ from sequence to sequence. If one chooses L to be conservative, 
then one can find a valid trajectory-dependent (therefore, random) time 
horizon serving all sequences from L. 



11. Auxiliary lemmas 

Proof of Lemma \8.3[ We prove that A cx) for every k. Due to (j8.6p . for 
any T > 0, 

(11.1) P{ri= <T}< P(^i(e,r)) + P(A2(e,r)) + P(A3(e,r)), 

where 



A,{e,T) = \ e%iy e'^'^'^)\ee\>'- \ , 



A2{e,T) = |re < T; 3t < r^, : £6^*=* 

^3(e,r) = |re <T; 3t < t, : e^^' 
Now 

P(^i(e,r))<p||e,1> 



f e-^^'B\Ye{s))dW{s] 
Jo 



e'^>='C''{Yeis))ds 



> 



4(1 Ve^*^) 



^ 0, e ^ 0, 
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To estimate A2{e,T), we can write 



P{A2{e,T))<p\ sup \N^{t)\>-^^—\=O{e'^'-^^)^0, 
t<T,AT 4(1 V e^fc-' ) 



which fohows from consecutive apphcation of the Chebyshev inequahty, 
BDG inequahties, and an elementary estimate on the quadratic variation 
of which was defined in (jS.Sp . 

Next, we notice that for sufficiently small e, 

P{As{£,T)) < p|(l Ve^'=^)rC7* > ^1 =0, 

where C* = sup |C(y)|, which completes the proof. □ 

Sketch of a proof of Lemma \8.4\ The proof is similar to the proof of Lemma [8.3l 
and uses the fact that among the factors 6^*=*, the one with k = 1 grows 
fastest of all in t, and t is large due to Lemma 18.31 so that the exit level e° 
is first reached by — (5'*yo)^| with high probability. □ 

12. Proof of properties of the space of curves X 

Any nondecreasing continuous map of [0, 1] onto itself is called a time 
change. For a time change A we define its inverse by 

X-\s) = mf{t : X{s) > s}. 
The inverse function is nondecreasing and satisfies 

XoX-\s) = s, sG[0, li- 
lt is continuous on the left and has limits on the right, but it may have jumps 
associated with segments of constancy, and therefore it is not necessarily a 
time change. Nevertheless, the following lemma holds true. 

Lemma 12.1. Let Ai and X2 be two time changes. Suppose that each seg- 
ment of constancy of Xi is contained in a segment of constancy of X2- Then 
X2 ° Xi"^ is a time change. 

Proof. The function A2oA^^ is obviously nondecreasing with A2oA5'^(0) = 0. 
Let us prove that it is continuous. Take any point s € [0, 1] and let 

[t^,t+] = {t:Xi{t) = s]. 

Since A^^(s+) = t+ and A|f^(s— ) = t-, we have 

X2oXl\s+) = X2{t+) 

and 

X2oX^\s-) = X2{t-). 

By the assumption of the lemma, A2(i+) = A2(t-). Therefore, 

A2 o A7^(s+) = A2 o Aj;^(s-), 
and the continuity is proven. 
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If s = 1, then, using the same notation and reasoning we see that = 1, 
and 

A2°Ar'(l-)=A2(t-) = A2(l) = l, 
and the proof is complete. □ 

Lemma 12.2. For any nonconstant path 7 there is a path 7' with no inter- 
vals of constancy, and a time change A so that 7 = 7' o A. 

Proof. Let Ii,l2, ■ ■ ■ be maximal nondegenerate segments of constancy of 7. 
By assumption, none of these segments coincides with [0, 1]. Therefore, there 
is a time change A that has these and only these maximal nondegenerate 
segments of constancy (A may be constructed analogously to the Cantor 
staircase). It is easy to see that 7' = 7 o A^"*^ is continuous. 

Let us assume that there is a nodegenerate segment [s_,s+] and a point 
X, such that j'{s) = x for all s € [s_,s+]. This means that 7(t) = x for 
all t € A^^([s_, s+]). Now notice that A^^([s_, 5+]) consists of the left ends 
of all maximal (not necessarily nondegenerate) segments of constancy of A 
(and thus of 7) contained in = [A~^(s_), A~^(s+)]. So, for the left 

end t of any interval of constancy contained in we have 7(t) = x. 

Therefore, in fact, 7(t) = x for all t G We conclude that is 

contained in some segment of constancy of A, so that 

s_ = X{t_) = A(t+) = s+, 

which contradicts the nondegeneracy of [s-, This finishes the proof that 
7' = 7 o A""*^ is a curve with no intervals of constancy. To see that 7 = 7' o A, 
we notice that for each s, 

A-i o A(s) = inf{so : A(so) = A(s)} 

is the left-end of the segment of constancy of A (and 7), containing s. There- 
fore, 

7' o A(s) = 7 o A"^ o A(s) = 7(s), s G [0, 1], 
and the proof is complete. □ 

Proof of Lemma \4.1\ Of all equivalence properties, only the transitivity is 
not quite obvious. So, we assume that 71 ~ 72 and 72 ~ 73 and prove that 
71 ~ 73- In other words, our assumption is that there are paths 712, 723 and 
time changes Ai2,i, Ai2,2 A23,2, A23,3 such that 

71 = 712 o Ai2,l, 72 = 712 o Ai2,2, 

72 = 723 ° A23,2 , 73 = 723 ° ^23,3- 

According to Lemma [12. 2 1 there is a path 7' with no intervals of constancy, 
and a time change A such that 



72 = 7' O A = 712 o Ai2,2 = 723 ° A23,2- 
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This representation implies that segments of constancy of time changes Ai2,2 
and A23,2 are contained in segments of constancy of A, so that Lemma ll2.1l 
imphes that A o X122 and A o A^g^g ^-re time changes. We also see that 

712 = 7' ° ° K2,2^ 
723 = 7'°Ao A^3^2' 

SO that 

71 = 7' o A o X^2,2 ° ^12,1 > 

73 = 7' ° A o A23';2 o A23,2, 

which completes the proof, since the paths 71 , 73 are represented via a com- 
mon underlying path 7' and time changes A o X^2 2 ° -^12,1 and A o A^3^2 ° ^^23, 2; 
respectively. □ 

For a curve F G X, we shall denote by F' the set of all 7 S F with 
no segments of constancy. Lemma 112.21 shows that F' ^ for any curve 
represented by a nonconstant path. 

For a continuous function / : [0, 1] M.'^^^, we use 

I/loo = sup 

se[o,i] 

Lemma 12.3. (1) For every 71 G F and 72 G F', there is a time change 
A such that 71 = 72 o A. //71 G F', then X is a bijection. 

(2) For every 7 G F and every e > 0, there is a js ^ T' such that 

I7 - 7e|oo < e- 

(3) Every curve is a closed set in the sup-norm | • |oo- 

(4) The definition (j4.ip is equivalent to 

(12.1) d(Fi,F2)= inf I71-72I00. 

7ier;,72Gr^ 

Proof. Part [TJ If 71 G F, 72 G F', then there are time changes Ai, A2 and a 
path 7' with 7i = 7' o Ai and 72 = 7' o A2. The map A2 is strictly increasing 
and 7' G F', since if any of these two conditions is violated then 72 has a 
segment of constancy. In particular, A^^ is also a time change, and we can 
write 7' = 72 o X2^ ■ So, 71 = 72 o X2^ o Ai, and we can set A = X2^ o Ai. If 
A is not a bijection, then 71 has an interval of constancy, and the proof of 
part [H is complete. 

To prove part [21 we use part [1] to find 7' G F' and a time change A so that 
7 = 7' o A. For any 5 > 0, Xs{s) = 5s + {1 — <5)A(s) defines a time change. 
Notice that jA^ — A|oo < 2J and 7' o A5 G F'. Due to the uniform continuity 
of 7, 

I7 - 7' o A^loo = I7' o A - 7' o A^loo < e, 
for sufficiently small 6, and we are done. 
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Part [3l We have to prove that if 7„ G F for ah n G N, and Hm„^oo \ln — 
^|oo = 0, then /i € r. Due to part [21 it is sufficient to consider the case 
where 7„ E V for all n. In this situation there exists a sequence of time 
changes A„ and 7 € F' such that 7„ = 7 o A„ for all n. Due to Helley's 
Selection Theorem, see [3, Appendix II], there is a sequence n' — > 00 and a 
nondecreasing function Aqo : [0, 1] [0, 1] such that lim„/_^oo \i'{s) = \oo{s) 
for every point of continuity s of Aqo- This implies 

(12.2) /"(s) = lim 7„/(s) = lim 7 o A„'(s) = 7 o Aoo(s) 

n'— >oo n'— >oo 

for all points of continuity s. 

Our goal is to show that Aqo is actually continuous at every point in 
[0,1]. That will imply that Xn' converges to Aqo uniformly, since all these 
functions are nondecreasing, and we shall be able to conclude that Aqo (0) = 
0, Aoo(l) = 1, so that Aqo is a time change. Moreover, it follows that ()12.2p 
holds for all s € [0, 1], so that = 7 o Aqo and G F. 

So, we assume that there is a point sq G (0, 1) with Aoo(so— ) < Aoo(so+) 
and we are going to show that this assumption contradicts the uniform 
convergence of 7„/ to /i. 

Take any point r G (Aoo(so-), Aoo(so+)) with 7(r) / 7 o Aoo(so+) and 
7(r) 7^ 70 Aoo(-so— ) (this can be done since 7 has no segments of constancy), 
and define 

(12.3) e = ^ min{|7(r) -70 Aoo(so+)|, \l{r) - 7 o Aoo(so-)I}- 

The continuity of 7 and the existence of the right and left limits of Aqo at 
So allow us to choose (5 > so that 

(12.4) sG(so,so + (5) implies I7 o Aoo(s) - 7 ° Aoo(so+)| < e, 

(12.5) sG(so-^, So) implies I7 o Aoo(s) - 7 ° Aoo(so-)| < £• 

Since A„/ converges to Aqo at all its continuity points, and these are dense, 
we see that there is a number no such that for every n' > no, there is a 
continuity point s(n') G (sq — <5, sq + ^) with A„/(s(n')) = r. For these n', if 
s = s{n') G (so, sq + 5), ([113]) and (fT2^ imply 

\ln'{s) - 7 o Aoo(s)| = |7('^) - 7 ° Aoo(s)| 

> \^(r) _ ^ o Aoo(so+)| - I7 o Aoo(s) -70 Aoo(so+)| 

> 2e - e = e, 

s(n') G (so - (5, So), ([TO]) and (fT23]) imply 

7 ° Aoo(s)| = |7('^) - 7 ° Aoo(s)| 

> |^(r) _ ^ o Aoo(so-)l - I7 o Aoo(s) -70 Aoo(so-)| 
>2e-e = £, 

So, for all n' > no there is s G (so — 5,so + 6) such that 

hn'{s) - //(s)| = |7n'(s) - 7 O Aoo(s)| > £, 



and if s = 

hn'is) - 
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and we obtain a contradiction with the uniform convergence of 7„' to /i. 
This completes the proof that there is no discontinuities of Aoo within (0, 1). 
The demonstration showing that Aoo is continuous at the endpoints of [0, 1] 
is similar. This completes the proof of part [3l 

Part [4] follows immediately from part [21 The lemma is proven completely. 

□ 

Proof of Theorem \4-l\ Part 1. We notice first that d is nonnegative and 
symmetric. To show the triangle inequality, for any three curves ri,r2,r3 
and e > we use part HI of Lemma 112.31 to find 71^2 £ P'l, 72, ii 72,3 £ and 
73,2 e with 

|7i,2 - 72,1 loo < d(ri,r2) + e 

and 

|73,2 - 72,3|oo < d{T-i,T2) + e. 
Let us find an time change A such that 72,3 = 72,1 o A. Then 

d{Vx,V-i) < |7i,2 o A - 73,2|oo 

< |7i^2 O A - 72,1 o A|oo + [72,1 o A - 73,2|oo 

< |7l,2 - 72,1 loo + |72,3 - 73,2 loo 

<d(ri,r2) + d(r3,r2) + 2e, 

and the desired inequality follows since e is arbitrarily small. 

If d(ri, r2) = 0, then there are two sequences of paths 71,^ € T'^, 72,n € T'2 
with lim„_^oo |7i,n — 72,n|oo = 0. Using appropriate time changes, we see that 
there is 71 G r[ and 72,n G Tg, n G N such that lim^^oo |7i — 72,™ I = 0. Due 
to part [3] of Lemma 112.31 we see that 71 G r2. Therefore, Ti = r2, and we 
are done with part 1. 

Part 2. The space X is separable since it inherits a dense countable set 
from a closed subset (of functions with nondecreasing zeroth coordinate) of 
the separable space C([0, 1] — >• [0, 00) x R'^). 

Let us now prove that X is complete. Suppose that (r„)„gN is a Cauchy 
sequence in (X, p). Using Lemma 112.31 and reparametrization, we can find 
an increasing number sequence {nk)k<^N, with Uk G N, and a sequence of 
paths (7fc)fcgN with 7^ G T^^ such that — 7fc+i|oo < 2"*^. This sequence 
of paths is Cauchy in C and, therefore, converges to a path 700 which de- 
fines a path Too- Obviously, r^^, converges to Too as A: ^ cxd. One can 
now use the triangle inequality to extend this convergence to the whole se- 
quence (r„). So every Cauchy sequence is convergent, which completes the 
proof of Theorem 14.11 □ 

Proof of Lemma \4-^ We prove the necessity part of the statement since the 
sufficiency part is obvious. Due to Lemma [1231 we may assume that there 
is a sequence of paths 7^ G T^^ such that 

(12.6) |7n-75loo^0, n^oo, 
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where 7^ is defined in (14. 2p . Therefore, 

sup |7^(s) — sT\ — > 0, n — )• 00. 

SG[0,1] 

This in turn imphes that if we define a strictly increasing and continuous 
function Un{s) by 7^(u(s)) = sT, then 

sup \un{s) — sT\ — > 0, n ^ 00. 

sG[0,l] 

This, together with ()12.6p and uniform continuity of g proves that 

sup \fn{sT) - g{sT)\ = sup |7„(n„(s)) - 7g(s)| ^ 0, n ^ 00, 

sG[0,l] sG[0,l] 



and the proof is complete. 



□ 



Proof of Lemma \ 7. 2\ It is straightforward to see that the path 7^ defined 
by 



7/(s) 



r(7°(s),/n7°(^)),...,W(^))), 
(r2„/nr-2i),...,nr2i)), 

. {r2j+i,f^ir2j+i), /'^(r2j+i)), 
belongs to Fj and I7 — 7/|oo < 35. 



7°(s) e [r2j-i,r2j], j = 1,.. 
7°(s) > »^2i, s < S2j, j = 0, 

S G [S2j5 'S2j+l]' J = 0) • • • ) ^ 

s > s'2j+i> 7°('S) < r2j+i, j 

□ 
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